AITopics | zero-shot generalization ability

25b040c97a75021e57100648a20b1e10-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 03:28:45 GMT

agent, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

25b040c97a75021e57100648a20b1e10-Paper-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 03:28:42 GMT

Add feedback

Self-OrganizedGroupforCooperativeMulti-agent ReinforcementLearning

Neural Information Processing SystemsFeb-18-2026, 23:37:11 GMT

The framework of centralized training with decentralized execution (CTDE) [8,28]isone ofthe popular frameworks for solving cooperative multi-agent tasks.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > South Holland > Leiden (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.35)

Add feedback

25b040c97a75021e57100648a20b1e10-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 23:13:07 GMT

agent, conductor, dynamic team composition, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Self-Organized Group for Cooperative Multi-agent Reinforcement Learning

Neural Information Processing SystemsDec-23-2025, 22:16:13 GMT

Centralized training with decentralized execution (CTDE) has achieved great success in cooperative multi-agent reinforcement learning (MARL) in practical applications. However, CTDE-based methods typically suffer from poor zero-shot generalization ability with dynamic team composition and varying partial observability. To tackle these issues, we propose a spontaneously grouping mechanism, termed Self-Organized Group (SOG), which is featured with conductor election (CE) and message summary (MS). In CE, a certain number of conductors are elected every $T$ time-steps to temporally construct groups, each with conductor-follower consensus where the followers are constrained to only communicate with their conductor. In MS, each conductor summarize and distribute the received messages to all affiliate group members to hold a unified scheduling. SOG provides zero-shot generalization ability to the dynamic number of agents and the varying partial observability.

cooperative multi-agent reinforcement learning, name change, self-organized group, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.65)

Add feedback

Self-Organized Group for Cooperative Multi-agent Reinforcement Learning

Neural Information Processing SystemsOct-10-2024, 09:15:02 GMT

Centralized training with decentralized execution (CTDE) has achieved great success in cooperative multi-agent reinforcement learning (MARL) in practical applications. However, CTDE-based methods typically suffer from poor zero-shot generalization ability with dynamic team composition and varying partial observability. To tackle these issues, we propose a spontaneously grouping mechanism, termed Self-Organized Group (SOG), which is featured with conductor election (CE) and message summary (MS). In CE, a certain number of conductors are elected every T time-steps to temporally construct groups, each with conductor-follower consensus where the followers are constrained to only communicate with their conductor. In MS, each conductor summarize and distribute the received messages to all affiliate group members to hold a unified scheduling. SOG provides zero-shot generalization ability to the dynamic number of agents and the varying partial observability.

cooperative multi-agent reinforcement learning, self-organized group, zero-shot generalization ability, (2 more...)

Neural Information Processing Systems

Technology: